Model Selection

Long Text Support

# Long Text Support

Ruri v3 is a Japanese general-purpose text embedding model based on ModernBERT-Ja, supporting sequences up to 8192 tokens long and achieving state-of-the-art performance in Japanese text embedding tasks.

Text Embedding Japanese

Embedder Collection

Multilingual embedding model for German and English, supporting a context length of 8192 tokens

Text Embedding Supports Multiple Languages

Khmer Mt5 Summarization 1024tk V2

An improved Khmer text summarization model based on mT5-small, supporting inputs of up to 1024 tokens, suitable for summarizing Khmer articles, paragraphs, or documents.

Text Generation

Transformers Other

Inf Retriever V1 1.5b

INF-Retriever-v1-1.5B is a dense retrieval model based on large language models developed by INF TECH, optimized and fine-tuned for Chinese-English data retrieval tasks.

Transformers Supports Multiple Languages

Snowflake Arctic Embed L V2.0 Gguf

Snowflake Arctic-embed-l-v2.0 is the latest embedding model released by Snowflake, specifically designed for multilingual workloads, optimizing retrieval performance and inference efficiency.

Text Embedding Supports Multiple Languages

Snowflake Arctic Embed L V2.0 GGUF

The GGUF quantized version of Snowflake Arctic Embed L v2.0 is an efficient multilingual text embedding model, suitable for high-quality retrieval tasks.

Ruri is a Japanese universal text embedding model, focusing on sentence similarity calculation and feature extraction, with support for long text processing.

Text Embedding Japanese

Ruri-Large is a high-performance embedding model specialized in Japanese text similarity calculation, based on transformer architecture with support for long text processing (maximum length 8192).

Safetensors Japanese

Ruri is a model specialized in Japanese text embedding, capable of efficiently calculating sentence similarity and extracting text features.

Text Embedding Japanese

Ruri is a universal text embedding model for Japanese, focusing on sentence similarity and feature extraction tasks.

Text Embedding Japanese

Gte Multilingual Reranker Base

The first multilingual reranking model in the GTE series, supporting 70+ languages with high performance and long text processing capabilities.

Transformers Supports Multiple Languages

This is the ONNX quantized version of the BAAI/bge-m3 model, supporting three functionalities: dense retrieval, multi-vector retrieval, and sparse retrieval, covering over 100 languages.

Chinese Llama 2 7b

Chinese-LLaMA-2-7B is an extended Chinese version of Meta's Llama-2 model, optimized with an expanded Chinese vocabulary and incremental pre-training to enhance Chinese comprehension.

Large Language Model

Transformers Supports Multiple Languages

Mlong T5 Large Sumstew

This is a multilingual, long-text (supports up to 16k input tokens) abstractive summarization model. Trained on the sumstew dataset, it can generate titles and summaries for given input documents.

Text Generation

Transformers Supports Multiple Languages

PEGASUS is a pretrained model based on gap sentence extraction, specifically designed for abstractive text summarization tasks.

Text Generation

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase